Automatically Adjusting Content Taxonomies for Hierarchical Classification
نویسندگان
چکیده
Hierarchical models have been shown to be effective in content classification. However, the performance of the model heavily depends on the given hierarchical taxonomy. We empirically show that different taxonomies can result in significant differences in hierarchical classification performance. Motivated by some real application problems, we aim to modify a content taxonomy automatically for different applications. In this work, we formulate the problem, discuss why it is feasible to achieve better performance in terms of classification performance via adjusting a given hierarchy, and present one effective solution to find better hierarchies compared with that of the given original hierarchy. Preliminary experiments on some real world data sets are reported and discussed.
منابع مشابه
Inferring Efficient Hierarchical Taxonomies for MIR Tasks: Application to Musical Instruments
A number of approaches for automatic audio classification are based on hierarchical taxonomies since it is acknowledged that improved performance can be thereby obtained. In this paper, we propose a new strategy to automatically acquire hierarchical taxonomies, using machine learning methods, which are expected to maximize the performance of subsequent classification. It is shown that the optim...
متن کاملAutomatic Construction of Taxonomies of Categories
Hierarchies are an intuitive and effective organization paradigm for data. Of late there has been considerable research on automatically learning a hierarchical organizations of data. In this paper, we formulate the problem of “automatically constructing hierarchical taxonomies”, which we define as learning a hierarchy of categories with no user defined parameters. We propose a framework that c...
متن کاملRapid Induction of Multiple Taxonomies for Enhanced Faceted Text Browsing
In this paper we present and compare two methodologies for rapidly inducing multiple subject-specific taxonomies from crawled data. The first method involves a sentence-level words co-occurrence frequency method for building the taxonomy, while the second involves the bootstrapping of a Word2Vec based algorithm with a directed crawler. We exploit the multilingual open-content directory of the W...
متن کاملAcclimatizing Taxonomic Semantics for Hierarchical Content Classification
Hierarchical models have been shown to be effective in content classification. However, we observe through empirical study that the performance of a hierarchical model varies with given taxonomies; even a semantically sound taxonomy has potential to change its structure for better classification. By scrutinizing typical cases, we elucidate why a given semantics-based hierarchy does not work wel...
متن کاملNotes on hierarchical ensemble methods for DAG-structured taxonomies
Several real problems ranging from text classification to computational biology are characterized by hierarchical multi-label classification tasks. Most of the methods presented in literature focused on tree-structured taxonomies, but only few on taxonomies structured according to a Directed Acyclic Graph (DAG). In this contribution novel classification ensemble algorithms for DAG-structured ta...
متن کامل